Analyzing Persian Social Networks: An Empirical Study

نویسندگان

  • Leila Esmaeili
  • Mahdi Nasiri
  • Behrouz Minaei-Bidgoli
چکیده

Analysis of data in social networks is very important for researchers, sociologists, and academics. Given the size and diversity of web data in a Web 2.0 environment, analyzing this data has been a challenge. Since data act as inputs in such projects, the accuracy of the output is directly related to the input. Good data allows for extraction of valuable knowledge. In this article, the authors present their experiences with preparation and preprocessing of data in a Persian social network. The authors also report on the analysis of the data and findings. DOI: 10.4018/jvcsn.2011070104 IGI GLOBAL PROOF 74 International Journal of Virtual Communities and Social Networking, 3(3), 73-92, July-September 2011 Copyright © 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. are conducted without preprocessing due to the difficulty in collecting large amounts of data. Sometimes studies are conducted using smaller data sets to make preprocessing faster. In our analysis of Persian social networks we did not come across any study that included complete preprocessing of data. Most of the research was conducted using blogs. Studies were based on data that had been collected by other researchers or in some cases Persian data was translated into English and then preprocessed (Sheykh Esmaili, Jamali, Neshati, Abolhassani, & Soltan-Zadeh, 2006; Sahebi, Oroumchian, & Khosravi, 2008). Esmaeili et al. (2011) used data stored in Parsi-yar Persian social network database to personalize recommended groups to users of the social network (Esmaeili, Nasiri, & Minaei-Bidgoli, 2011). The studied database consisted of content data and to some extent structured data. The data set was a raw one, which was analyzed for the first time. Complete preprocessing of a large volume of data in a Persian social network for the first time, classification of textual features are some of the study’s strengths. In this study, we elaborate on some preprocessing experiments and provide details of statistical and network analysis of the data set. The data set employed in our attempt included data from a Persian social network called Parsi-yar. Parsi-yar contained activities for 5 years and 6 months for 78467 users, 3359 groups within 19 categories, and 275 groups without a specified category (Table 1). Data could be classified into three categories: user information, group information, and other information. The category, other information, included user interactions in the network, their public and private messages, users’ comments on messages, their friends, and their groups. The organization of the rest of the paper is as follows. In the next section we explain the framework of data processing. This is followed by statistical analysis of research data and by network analysis of the data set. In the final section we provide a conclusion and areas of future research. DATA PREPROCESSING FRAMEWORK Users usually register for membership of social networks without providing complete information about themselves. They either volunteer data for required items or give worthless or meaningless information, and only a few of them correct this in subsequent visits. This has been attributed to factors such as privacy concerns, or the rush to register quickly. Thus, there is always the problem of lack of data or worthless data in our social network databases. Since databases usually contain data in different fields with different data types, each of these data types demands different preprocessing techniques. Determining the best preprocessing techniques for each category of data type plays an important role in improving the quality and Table 1. Subjective classification of groups (active and inactive groups) Category Num. of groups Category Num. of groups Category Num. of groups Revolution 100 Sport 167 History 37 Social 716 Entertainment 332 Literature 117 Sciences 191 Game 75 Moral i ty and Spirituality 129 Youth 72 Familial 28 News 71 Geography 41 Art 182 Hygiene 30 Buy and Sell 31 Religion 335 Computer 352

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analyzing Tools and Algorithms for Privacy Protection and Data Security in Social Networks

The purpose of this research, is to study factors influencing privacy concerns about data security and protection on social network sites and its’ influence on self-disclosure. 100 articles about privacy protection, data security, information disclosure and Information leakage on social networks were studied. Models and algorithms types and their repetition in articles have been distinguished a...

متن کامل

Analyzing Correlation between Internationalization Orientation and Social Network

 The research on social networks and collaborative strategies has highlighted from the mid of 1980 which has contributed to the success and development of firms. The relationship and communication with trade partners in overseas help success of firms in entering to foreign markets and improve new partners and new markets abroad. Doing firm internationalization in foreign countries faces some ba...

متن کامل

A committee machine approach for predicting permeability from well log data: a case study from a heterogeneous carbonate reservoir, Balal oil Field, Persian Gulf

Permeability prediction problem has been examined using several methods such as empirical formulas, regression analysis and intelligent systems especially neural networks and fuzzy logic. This study proposes an improved and novel model for predicting permeability from conventional well log data. The methodology is integration of empirical formulas, multiple regression and neuro-fuzzy in a commi...

متن کامل

Using an Evaluator Fixed Structure Learning Automata in Sampling of Social Networks

Social networks are streaming, diverse and include a wide range of edges so that continuously evolves over time and formed by the activities among users (such as tweets, emails, etc.), where each activity among its users, adds an edge to the network graph. Despite their popularities, the dynamicity and large size of most social networks make it difficult or impossible to study the entire networ...

متن کامل

Perception and Сontent Assessment of Active Users: Russian Language Social Networks

The paper considers studying the perception and assessment of media content in the Russian-language social networks, analyzing the causes that affect the perception and distribution of network content. The importance of language learning and communication in Russian-language social networks, and problems of content effectiveness is determined by the growth in the number and activity of Runet us...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJVCSN

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2011